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Functional  voice  outcomes  after 
thyroidectomy:  An  assessment  of  the 
Dsyphonia  Severity  Index  (DSI) 
after  thyroidectomy 


Leonard  R.  Henry,  Leah  B.  Helou,  MA,  CCC-SLP,'*  Nancy  Pearl  Solomon,  PhD,  CCC-SLP,® 

Rohin  S.  Howard,  MA,^  Joyce  Gurevich-Uvena,  MA,  CCC-SLP,®  George  Coppit,  MD,®  ared  Alexander 
Stojadinovic,  Bethesda,  MD,  and  Washington,  DC 

Background.  The  Dysphonia  Severity  Index  (DSI)  is  an  objective  multiparametric  acoustic  calculation  of 
vocal  function;  however,  its  changes  after  thyroidectomy  have  not  yet  been  described. 

Methods.  Patient-reported  symptoms,  as  well  as  auditory  perceptual,  acoustic,  and  videolaryngostrobo- 
scopic  (VLS)  data,  were  collected  prospectively  before  and  after  thyroidectomy.  Voice  outcomes  (normal 
versus  negative  voice  outcome  [NVO])  at  6  months  after  thyroidectomy  were  based  on  a  combination  of 
voice  symptoms  and  objective  findings.  The  DSI  was  assessed  over  the  perioperative  course,  and 
differences  were  determined  with  Wilcoxon  signed  rank  tests.  The  DSI  was  compared  between  study 
groups  (normal  versus  NVO)  using  t  tests,  analyses  of  variance  (ANOVAs),  or  rank  sum  tests  as 
appropriate.  The  predictive  value  of  DSI  for  long-term  voice  dysfunction  was  assessed  by  an  area  under 
the  receiver  operating  characteristics  curve  analysis.  Correlations  betiveen  DSI  and  Consensus  Auditory 
Perceptual  Ratings  of  Voice  ( CAPE-V)  and  the  patient  reported  Voice  Handicap  Index  (VHI)  were 
determined  with  Pearson ’s  correlation  coefficients. 

Results.  In  all,  62  patients  were  evaluated  before,  1-4  weeks  after,  and  6  months  after  thyroidectomy. 
Eight  (13%)  patients  were  diagnosed  with  NVO  at  6  months.  The  DSI  was  different  postoperatively 
between  NVO  and  normal  voice  fP  =  .005,  repeated  measures  [RMj-ANOVA),  with  the  NVO  group 
demonstrating  a  lesser  DSI  value  and  greater  change  from  pre-operative  assessment  at  the  first  postop¬ 
erative  visit  when  compared  with  the  normal  group  (P  <  .006  each).  The  DSI  differed  significantly  for 
pre-operative  and  6-month  assessments  according  to  sex,  smoking  status,  and  age.  Short-term  postop¬ 
erative  DSI  (area  under  the  curve  [AUC]  =  0. 795)  and  DSI  change  from  baseline  to  1-4  weeks 
(AUC  =  0.835)  were  highly  predictive  of  6-month  NVO.  DSI  measurements  over  the  post-thyroidectomy 
course  were  correlated  poorly  to  moderately  (maximum  r  =  -0.62)  with  CAPE-V  and  VHI  assessments  for 
the  same  time  points. 

Conclusion.  The  DSI  is  decreased  in  the  early  post-thyroidectomy  period,  mostly  in  persons  who  were 
ultimately  found  to  have  a  long-term  NVO.  Early  postoperative  DSI  and  change  of  DSI  from  baseline  at 
1-4  weeks  postoperation  predict  long-term  post-thyroidectomy  voice  dysfunction.  The  modest  correlations 
between  the  DSI  and  other  vocal  assessments  point  to  the  utility  of  DSI  as  an  independent  predictor  of 
voice  dysfunction  after  thyroidectomy,  which  can  select  patients  who  may  benefit  from  voice  therapy. 
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Altered  voice  is  a  common  problem  after  thyroid 
surgery.  Previous  studies  demonstrate  that  25%  to 
almost  90%  of  patients  report  abnormal  voice 
within  the  first  few  weeks  after  operation,  and 
11-15%  of  patients  report  persistent  voice  prob¬ 
lems  3-6  months  after  thyroidectomy.^'*^  Certainly, 
major  laryngeal  nerve  injuries  account  for  a  large 
portion  of  this  incidence  and  are  well-established 
and  feared  complications  of  thyroidectomy,®'^^  but 
many  patients  suffer  long-term  negative  voice  out¬ 
comes  (NVOs)  and  have  no  evidence  of  laryngeal 
nerve  injury.^^  Various  non-neurogenic  mechanisms 
have  been  postulated  to  account  for  post-thyroidec- 
tomy  voice  changes,  including  the  effects  of  endo¬ 
tracheal  intubation^^  and  other  alterations  in 
normal  anatomy  and  mechanical  factors. 

The  ability  to  identify  early  changes  in  voice 
indicative  of  durable  dysfunction  (both  neuro¬ 
genic  and  non-neurogenic)  would  facilitate  early 
referral  for  comprehensive  voice  evaluation  aimed 
to  improve  quality  of  life,  prevent  secondary 
injuries,  and  identify  patients  who  might  benefit 
from  vocal  fold  augmentation.  Predictive  abilities 
of  all  postoperative  surveillance  strategies  (the 
optimal  1  yet  to  be  defined)  remain  under  the 
influence  of  a  myriad  of  factors,  which  include 
the  reliance  on  the  patient’s  report  of  symptoms, 
the  surgeon’s  awareness  of  dysphonia,  and  his  or 
her  auditory  perception  and  expertise  in  interpret¬ 
ing  laryngoscopic  findings.  In  contrast,  acoustic 
measurements  are  objective  and  relatively  easy  to 
obtain  and  interpret. 

Wuyts  et  al^®  developed  a  multivariate  model  of 
acoustic  measures  obtained  from  patients  with  and 
without  vocal  pathology,  in  which  parameters  were 
weighted  to  create  an  index  of  vocal  function.  This 
objective  measure,  which  was  termed  the  Dyspho¬ 
nia  Severity  Index  (DSI),  incorporated  measure¬ 
ments  of  maximum  phonation  time  (MPT,  in 
seconds),  highest  vocal  frequency  (FO-high, 
Hertz),  lowest  vocal  intensity  (Flow,  decibels)  and 
percent  jitter  (a  measure  of  vocal  frequency  varia¬ 
bility)  .  The  DSI  is  scored  from  -5  to  +5,  where  4-5 
indicates  a  normal,  healthy  voice,  and  -5  repre¬ 
sents  a  severely  pathologic  voice.  DSI  measures 
have  been  shown  to  be  highly  related  to  clini¬ 
cian-perceived  and  patient-reported  severity  of 
voice  handicap. Similarly,  the  DSI  has  been 
used  clinically  as  a  tool  to  evaluate  the  effectiveness 
of  voice  therapy  and  training,  as  well  as  to  compare 
voice  quality  among  groups  with  laryngeal  pathol- 
ogy.  '  The  DSI  decreases  with  age,  which  re¬ 
flects  the  natural  deterioration  of  vocal  function 
with  increasing  age.  The  DSI  can  identify  patients 
with  dysphonia  caused  by  both  organic  and 


functional  voice  disorders  and  to  discriminate 
voice  pathology  resulting  from  lesions  or  vocal 
fold  paralysis  or  paresis. The  DSI  has  high  inter¬ 
observer  reliability,^^  and  test-retest  variability  re¬ 
sults  confirm  that  changes  in  DSI  >2.49  within  a 
given  patient  are  clinically  important.®^ 

Given  the  assumption  that  the  DSI  is  an  objec¬ 
tive,  reliable,  and  reproducible  assessment  of  voice 
quality  across  clinicians  and  centers,  the  applica¬ 
tion  of  DSI  to  evaluate  and  quantify  voice 
disturbances  after  thyroidectomy  is  appealing  con¬ 
ceptually.  The  goal  of  this  study  was  to  assess  vocal 
function  using  the  DSI  and  to  determine  the  range 
of  quantifiable  changes  that  can  be  expected  in 
patients  undergoing  thyroidectomy.  In  addition, 
we  aimed  to  determine  whether  the  DSI  has 
clinical  utility  for  identifying  patients  likely  to 
have  long-term  vocal  problems  after  thyroidec¬ 
tomy.  Finally,  we  correlated  the  DSI  with  expert 
perceptual  assessments  of  dysphonia  (Consensus 
Auditory  Perceptual  Evaluation  of  Voice  [CAPE- 
V] )  and  patient  reported  assessment  of  perceived 
voice  handicap  (Voice  Handicap  Index  [VH!]).^"^ 

METHODS 

Participants.  Patients  with  benign  thyroid  pa¬ 
thology  or  primary  nonanaplastic  thyroid  carci¬ 
noma,  who  were  scheduled  to  undergo  thyroid 
resection  under  general  or  regional  anesthesia, 
were  included  in  this  study  approved  by  the  Insti¬ 
tutional  Review  Board  of  the  Walter  Reed  Army 
Medical  Center.  Those  patients  who  had  under¬ 
gone  thyroidectomy  or  other  neck  operation  pre¬ 
viously,  as  well  as  patients  with  a  documented  voice 
disorder,  were  excluded. 

Recorded  data.  At  each  of  4  time  points 
(prethyroidectomy,  1-4  weeks  (mean,  14  days),  3 
months,  and  6  months  post-thyroidectomy),  par¬ 
ticipants  underwent  multiparametric  voice  testing. 
Each  testing  session  included  high-quality  voice 
recording,  collection  of  acoustic  measures,  voice 
expert  perceptual  analysis  of  voice  quality  employ¬ 
ing  the  CAPE-V  to  be  used  for  acoustic  and 
auditory-perceptual  analysis,  assessment  of  laryn¬ 
geal  structure  and  function  via  indirect  videolar- 
yngostroboscopy  (VLS),  and  collection  of  patient 
report  of  perceived  voice  handicap  using  the  VHI. 

Acoustic  recordings  were  collected  by  an  expe¬ 
rienced  speech-language  pathologist  (SEP)  using  a 
commercially  available  acoustic  analysis  system 
(Computerized  Speech  Lab  4500;  KayPENTAX, 
Lincoln  Park,  NJ).  Jitter  obtained  from  a  2-s 
steady-state  portion  of  a  3-4  s  sustained  /a/  pro¬ 
duced  with  habitual  voice  quality,  and  the  longest 
of  3  maximum  phonation  trials  were  included  in 
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the  DSI  equation.  In  addition,  the  highest  fre¬ 
quency  (Hz)  and  lowest  intensity  (dB)  were  eli¬ 
cited  by  the  clinician  across  several  trials  to  ensure 
that  the  production  was  representative  of  the 
subject’s  ability  at  each  time  point.  Measures  for 
each  parameter  were  entered  into  a  database,  and 
the  DSI  was  calculated  as  described  previously  by 
Wuyts.^*’ 

An  automated  version  of  the  CAPE-V  was  used 
to  present  all  voice  recordings  in  a  randomized, 
blinded  fashion.^’’  Listeners  were  exposed  to  sus¬ 
tained  vowels  (/i/  and  /a/)  and  the  6  sentences 
from  the  CAPE-V  in  any  order  and  as  many  times 
as  desired;  the  ratings  were  made  by  clicking  on 
a  defined  linear  region  for  each  CAPE-V  parame¬ 
ter.  The  voice  samples  were  grouped  by  sex  and 
presented  to  the  listener  with  a  sex-matched  audi¬ 
tory  anchor  that  was  used  at  each  listener’s  discre¬ 
tion.  Three  SLPs  with  experience  in  voice 
disorders  rated  all  voice  samples,  and  the  median 
of  the  ratings  from  all  3  SLPs  for  each  parameter 
of  the  CAPE-V  were  recorded  as  the  auditory  per¬ 
ceptual  measure  of  voice  quality. 

At  each  of  the  study  time  points,  the  patient’s 
perceived  impact  of  voice  handicap  was  reported 
by  completion  of  the  VHI.  The  VHI  is  a  30- 
question  validated  assessment  of  vocal  handicap, 
which  is  stratified  into  the  domains  of  physical, 
functional,  and  emotional  impact  of  voice  deficit, 
as  determined  by  the  individual  patient. 

Imaging  of  the  larynx  during  phonation  and 
nonphonatory  tasks  was  performed  with  a  fiber¬ 
optic  flexible  laryngoscope  passed  transnasally, 
using  halogen  and  stroboscopic  light  sources.  A 
standardized  vocal  task  protocol  during  VLS  was 
employed.  Patients  who  demonstrated  notable 
laryngeal  abnormalities  were  diagnosed  by  a  con¬ 
sensus  committee  of  SLPs,  surgical  oncologists, 
and/ or  otolaryngologists,  all  of  whom  had  experi¬ 
ence  in  voice  disorders. 

Criteria  for  NVO.  The  combined  clinical  exper¬ 
tise  and  experience  of  our  research  team  (3  SLPs,  2 
surgical  oncologists,  and  2  otolaryngologists— all 
with  substantial  clinical  and  research  voice  experi¬ 
ence),  the  review  of  evidence-based  research,  and 
the  theoretic  understanding  of  voice  disorders 
were  incorporated  to  define  voice-specific  out¬ 
comes.  NVO  was  classified  as  objective  or  subjec¬ 
tive.  Those  patients  included  in  the  objective 
NVO  category  demonstrated  structural  or  physio¬ 
logic  laryngeal  abnormalities  evident  on  VLS,  com¬ 
bined  with  an  increase  in  VHI  score  &20  points 
from  preoperative  baseline  and/or  an  increase  in 
overall  severity  on  the  CAPE-V  ^11  points  from 
baseline.  Subjective  NVO  was  assigned  in  the 


Table  I.  Study  group  characteristics 


Number  (%) 


Age,  mean  years  ±  SD 

51  ±13 

Volume  thyroid  resected, 

63  cm^  (5-2,872) 

median  cm’,  (range) 

Maximum  tumor  size. 

1.3  (0.1-8.0) 

median  cm  (range) 

Sex 

Male 

28  (45%) 

Female 

34  (55%) 

Ethnicity 

Caucasian 

41  (66%) 

African  American 

15  (24%) 

Other 

6  (10%) 

Smoker 

Never 

41  (66%) 

Former 

18  (29%) 

Current 

3  (5%) 

Voice  professional 

Yes 

15  (24%) 

No 

47  (76%) 

Procedure 

Total 

37  (60%) 

Lobe 

25  (40%) 

Histology:  malignant 

Yes 

25  (40%) 

No 

37  (60%) 

SD,  Standard  deviation. 


absence  of  VLS  anatomic  or  physiologic  derange¬ 
ment  when  2  or  more  of  the  following  criteria 
were  met:  VHI  increase  &20  points  from  preopera¬ 
tive  baseline,  patient  report  of  atypical  voice  com¬ 
pared  with  pre-operative  status,  and  CAPE-V 
overall  severity  increase  £ll  points  from  preopera¬ 
tive  values.  All  patients  not  classified  as  having  a 
subjective  or  objective  NVO  were  categorized  as 
normal. 

Statistical  methods.  Comparisons  of  DSI 
changes  within  groups  over  time  were  assessed 
with  Wilcoxon  signed  rank  tests.  Comparisons  of 
DSI  and  cohort  characteristics,  as  well  as  compar¬ 
isons  of  raw  DSI  scores  and  DSI  changes  from 
preoperative  baseline  at  the  specified  time  points 
in  patients  with  and  without  NVO,  were  made  with 
t  tests,  ANOVAs,  or  Wilcoxon  rank  sum  tests  as  ap¬ 
propriate.  The  estimated  areas  under  the  curve 
(AUCs)  of  the  receiver  operating  characteristics 
(ROC)  curves  were  determined  for  the  DSI  at  1- 
4  weeks  postoperation  and  for  the  DSI  change 
from  preoperative  baseline  at  1-4  weeks  in  predict¬ 
ing  long-term  NVO  at  6  months  after  thyroidec¬ 
tomy.  The  correlations  of  DSI  with  CAPE-V  and 
VHI  measurements  at  the  specified  time  points 
were  assessed  with  Pearson  coefficients.  Statistical 
tests  were  conducted  with  SPSS  software  version 
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Table  II.  Study  group  characteristics  and  DSI  at  baseline  and  6  months  after  thyroidectomy 


Pre-operative  DSI 

P  value 

6-month  DSI 

P  value 

Change 

P  value 

Sex 

Male 

-0.12  (±  3.24) 

<.005* 

-0.24  (±  2.55) 

<.005* 

-0.11(±  2.40) 

NS 

Female 

2.62  (±  2.34) 

2.31  (±  2.24) 

-0.30(±  1.88) 

Ethnicity 

Caucasian 

1.19  (±  3.17) 

NS 

1.14  (±  2.84) 

NS 

-0.04  (±  1.86) 

NS 

African  American 

1.38  (±  3.36) 

0.86  (±  2.33) 

-0.52  (±  2.79) 

Other 

2.73  (±  1.21) 

2.06  (±  2.67) 

-0.67  (±  2.04) 

Smoker 

Current 

-1.22  (±  2.18) 

.03t 

-0.16  (±  2.05) 

.003) 

1.06  (±  0.16) 

NS 

Former 

0.22  (±  3.84) 

-0.45  (±  2.25) 

-0.67  (±  2.80) 

Never 

2.08  (±  2.52) 

1.97  (±  2.59) 

-0.11  (±  1.80) 

Voice  professional 

Yes 

1.21  (±  3.39) 

NS 

1.78  (±  3.13) 

NS 

0.56  (±  2.04) 

.10 

No 

1.43  (±  3.01) 

0.97  (±  2.54) 

-0.47  (±  2.10) 

Procedure 

Total 

1.37  (±  2.98) 

NS 

1.41  (±  2.45) 

NS 

0.04  (±  2.06) 

NS 

Lobe 

1.41  (±  3.29) 

0.81  (±  3.02) 

-0.60  (±  2.18) 

Histology:  malignant 

Yes 

1.31  (±  2.95) 

NS 

0.98  (±  2.03) 

NS 

-0.33  (±  2.37) 

NS 

No 

1.43  (±  3.21) 

1.29  (±  3.08) 

-0.14  (±  1.95) 

Histology:  benign  inflammatory 

Yes 

1.62  (±  3.06) 

NS 

1.47  (±  2.90) 

.11 

-0.15  (±  1.93) 

NS 

No 

0.80  (±  3.14) 

0.42  (±  1.96) 

-0.38  (±  2.56) 

Histology:  benign  noninflammatory 

Yes 

1.86  (±  2.55) 

NS 

2.10  (±  2.24) 

.12 

0.25  (±  1.69) 

NS 

No 

1.23  (±  3.24) 

0.86  (±  2.77) 

-0.37  (±  2.23) 

*Twosample  t  test. 
tANOVA. 

NS,  Nonsignificant,  P  >  .2. 


Table  III.  Correlations  between  study  group  characteristics  and  DSI  at  baseline  and  6  months  after 
thyroidectomy 


Pre-operative  DSI 

P  value 

6-month  DSI 

P  value 

Change 

P  value 

Age 

-0.49 

<.005* 

-0.48 

<.005* 

0.10 

.44 

Volume  thyroid  resected  (cm"*) 

-0.16 

.22 

-0.06 

.65 

0.15 

.24 

Maximum  tumor  size  (cm) 

-0.04 

.83 

-0.20 

.34 

-0.12 

.59 

*Pearson’s  correlation  coefficient. 


16  (SPSS  Inc.,  Chicago,  IL).  For  all  tests,  P<  .05 
was  considered  significant. 

RESULTS 

In  all,  62  patients  who  underwent  thyroidec¬ 
tomy  were  followed  with  comprehensive  voice 
assessment  before,  1-4  weeks  post-thyroidectomy, 
and  6  months  post-thyroidectomy.  Eight  of  62 
(13%)  patients  were  classified  at  6  months  as 
having  NVO;  of  these  patients,  6  had  subjective 
NVO  and  2  had  objective  NVO.  Both  patients  in 
the  objective  NVO  group  demonstrated  persistent 
signs  of  recurrent  laryngeal  nerve  paralysis.  Sub¬ 
jective  and  objective  NVO  patients  were  grouped 
into  1  NVO  cohort  for  statistical  analysis.  The 


clinical  characteristics  of  the  study  population  are 
summarized  in  Table  I.  Changes  in  DSI  at  6 
months  post-thyroidectomy  relative  to  the  pre-op¬ 
erative  baseline  were  compared  across  the  clinical 
characteristics  of  the  study  population;  significant 
differences  according  to  their  sex,  smoking  history, 
and  age  were  revealed  (Tables  II  and  III).  Women, 
patients  who  had  never  smoked,  and  younger  pa¬ 
tients  had  a  higher  (better)  DSI  at  the  pre-opera¬ 
tive  and  6  month  postoperative  time  points,  but 
the  degree  of  change  from  baseline  was  not  differ¬ 
ent  between  these  groups. 

The  DSI  was  less  in  the  entire  cohort  at  the  early 
postoperative  visit  (7^=. 008),  after  which  changes 
from  baseline  were  not  significant.  When  separated 
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Table  IV.  Values  of  DSI*  over  the  course  of  thyroidectomy  (pre-operatively  to  6  months  postoperatively) 

DSI 


Pre-operatively 

1-4  weeks 

3  months 

6  months 

Normal 

NVO 

2.05  (-6.40-6.83) 
1.45  (-1.94-3.93) 

0.88  (-8.77-7.90) 

-4..30  (-19.10-1.90) 

0.94  (-4.54-5.38) 
-0.89  (-3.43-2.25) 

1.17  (-6.21-6.73) 
0.04  (-1.65-2.24) 

Change 

'  in  DSI 

1-4  weeks) 

3  months 

6  months) 

Normal 

NVO 

P  value 

.75 

-0.64  (-14.72-5.05) 

-5.75  (-18.87-1.65) 

.006* 

■0.70  (-4.63-4.23) 

■2.37  (-4.83-2.00) 

.10 

-0.21  (-6.36-4.81) 
-1.19  (-4..54-1.19) 
.15 

*Median  (range)  per  group. 
fWilcoxon  rank  sum  test. 


ROC  Curve 


Fig  1.  ROC  curve  (AUC  =  0.835)  for  DSI  change  at 
1  week  from  baseline  in  predicting  an  NVO  at  6  months 
(P=  .002). 

hy  group,  however,  only  the  NVO  group  changed 
signihcantly  from  baseline.  The  DSI  over  the  post¬ 
operative  course  in  patients  with  and  without 
NVO  were  different  from  each  other  (F  <  .005) 
with  regard  to  the  absolute  DSI  score  at  1  week  post¬ 
thyroidectomy  (Table  IV;  F=  .006),  and  with  regard 
to  the  degree  of  change  in  DSI  at  1  week  from  base¬ 
line  (Table  IV;  P=.001). 

Both  the  absolute  DSI  score  at  1-4  weeks  after 
thyroidectomy  and  the  DSI  change  from  pre-oper¬ 
ative  baseline  at  the  Hrst  postoperative  visit  pre¬ 
dicted  NVO  at  6  months  with  area  under  the  ROC 
curve  of  0.795  (F  =  .008)  and  0.835  (F  =  .002),  re¬ 
spectively.  Figure  1  shows  the  ROCs  for  DSI  change 
from  baseline  at  1-4  weeks  after  thyroidectomy. 
The  sensitivity,  specihcity,  and  positive  and  nega¬ 
tive  predictive  values  of  various  DSI  changes  from 


pre-operative  baseline  thresholds  (-1.5  to  -3.5)  to 
1-4  weeks  after  thyroidectomy  are  shown  in  Fig 
2.  Lesser  changes  from  baseline  at  1-4  weeks 
post-thyroidectomy  are  more  sensitive  for  identify¬ 
ing  long-term  NVO,  whereas  greater  changes  from 
the  baseline  are  more  specific. 

Absolute  DSI  measurements  and  changes  in  DSI 
from  pre-operative  baseline  at  hrst  postoperative 
visit  correlated  with  VHI  and  CAPE-V  measure¬ 
ments  of  overall  severity  at  the  pre-operative  and 
postoperative  time  points  (Table  V) .  Several  statis¬ 
tically  signihcant  correlations  were  identihed,  with 
the  strongest  of  these  being  an  inverse  correlation 
between  the  change  in  total  VHI  score  at  1-4  weeks 
from  pre-operative  baseline  to  the  calculated 
change  in  DSI  at  1-4  weeks  (r=-0.62,  F<  .005). 

DISCUSSION 

Early  recognition  and  intervention  for  durable 
voice  problems  after  thyroid  surgery  can  lead  to 
efforts  aimed  at  improving  quality  of  life,  prevent¬ 
ing  maladaptive  behaviors  that  might  produce 
secondary  injury,  and  identifying  appropriate  pa¬ 
tients  for  consideration  for  vocal  fold  augmenta¬ 
tion  to  improve  voice  and  mitigate  aspiration  risk. 
The  ideal  surveillance  strategy  to  identify  patients 
at  risk  of  long-term  voice  problems  after  thyroid¬ 
ectomy,  however,  remains  elusive.  The  DSI  repre¬ 
sents  a  relatively  easily  obtained  and  serviceable 
measurement  of  voice  toward  that  end.  To  our 
knowledge,  this  article  is  the  hrst  to  evaluate  the 
DSI  in  the  context  of  post-thyroidectomy  functional 
voice  assessment.  In  this  study,  we  determined  the 
DSI  in  a  cohort  of  padents  who  underwent  thy¬ 
roidectomy  and  explored  the  utility  of  early  post¬ 
operative  DSI  changes  (determined  at  1-4  weeks, 
relative  to  pre-operative  baseline)  in  predicting 
long-term  NVOs  (at  the  sixth  postoperahve  month). 
In  addidon,  we  evaluated  whether  the  raw  (ie. 
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Fig  2.  Sensitivity,  specificity,  and  positive  and  negative  predictive  values  (y-axis)  of  DSI  change  at  1-4  weeks  from 
baseline  (x-axis)  in  predicting  an  NVO  at  6  months  post-thyroidectomy. 


Table  V.  Pearson’s  correlation  coefficients  pre-operatively  at  1  week,  3  months,  and  6  months 
postoperatively  (P  value)  between  absolute  CAPE-V/Total  VHI  and  DSI 


Pre  DSI 

Postl  DSI 

Post2  DSI 

Posts  DSI 

CAPE-V  severity 

Total  VHI 

-0.295  (0.02) 
-0.100  (0.44) 

-0.528  (<.005) 
-0.620  (<.005) 

-0.255  (.055) 
-0.235  (.076) 

-0.322  (.Oil) 
-0.198  (.12) 

Postl  DSI  change 

Post2  DSI  change 

Posts  DSI  change 

CAPE-V  severity  change 
Total  VHI  change 

-.580  (<0.005) 
-.601  (<0.005) 

-.329  (.012) 

-.272  (.039) 

-.180  (.17) 
-.211  (.099) 

Voice  outcomes  assessment  tools  and  NVO. 


absolute)  DSI  scores  or  changes  from  preoperative 
baseline  DSI  scores  determined  during  the  post¬ 
operative  follow-up  period  correlated  with  other 
measures  related  to  vocal  outcome  currently  in 
clinical  use  (ie,  the  patient-reported  VHI  and 
perceptual  ratings  of  voice  quality  via  the  CAPE-V) . 

Age,  sex,  and  smoking  status  were  identihed  as 
clinically  relevant  determinants  of  DSI  in  this 
patient  population.  At  the  first  postoperative  visit, 
the  absolute  DSI  and  the  change  from  preoperative 
baseline  were  both  signihcantly  different  in 
patients  with  normal  voice  versus  those  experienc¬ 
ing  persistent  NVO  6  months  after  thyroidectomy. 
Eurthermore,  DSI  change  from  baseline  to  the  first 
postoperative  visit  after  thyroidectomy  was 
predictive  of  which  patients  would  experience 
durable  NVO  at  6  months  post-thyroidectomy 
(AUG  =  0.835;  P=.002). 

Currently,  there  is  no  consensus  regarding  what 
constitutes  an  NVO  after  thyroidectomy.  Although 
laryngeal  nerve  injuries  are  considered  widely  to  be 
secondary  surgical  events,  the  non-neurologic 
causes  of  impaired  voice  after  thyroid  surgery  are 
often  unrecognized.  A  global  definition  of  subjec¬ 
tive  and  objective  NVOs  has  yet  to  be  established. 


Thus,  patient  complaints  that  cannot  be  linked  to 
physiologic  impairment  are  often  not  reported  in 
the  literature  or  are  variably  classihed.  In  this  study, 
we  have  also  introduced  a  multifactorial  dehnition 
for  subjective  and  objective  NVO  that  is  based  on 
years  of  collective  clinical  and  research  experience. 

Age  is  known  to  affect  the  DSI,^^  and  we  ob¬ 
served  the  same  phenomenon  in  this  study.  In  ad¬ 
dition,  patients  who  had  never  smoked  showed 
higher  (better)  DSI  values  at  the  pre-operative  ses¬ 
sion,  as  well  as  signihcantly  higher  DSI  calculations 
at  the  6-month  postoperative  time  point.  These  re¬ 
sults  are  intuitive,  because  smoking  is  known  to  de¬ 
crease  maximum  phonation  time^®  and  is  an 
established  contributor  to  dysphonia  and  other  la¬ 
ryngeal  pathology.^^  The  DSI  was  shown  originally 
to  be  independent  of  sex,  supposedly  because  the 
greater  maximum  pitch  of  the  female  voice  is  off¬ 
set  by  the  greater  maximum  phonation  time 
achieved  by  men.^®  Our  hndings  are  at  odds  with 
this  claim,  as  the  DSI  was  signihcantly  greater  in 
women  than  men  at  the  preoperative  and  6-month 
assessments,  although  the  degree  of  change  in  DSI 
from  baseline  to  any  postoperative  time  point  was 
not  different  across  sex.  One  explanation  for  this 
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Table  VI.  Pearson’s  correlation  coefficients  of  changes  from  baseline  in  GAPE-V/Total  VHI  and  DSI  at  each 

postoperative  time  point 

Abbreviation  Full  term 

Description 

Benefits 

Limitations 

DSI  Dysphonia 

An  acoustic 

Objective,  easily 

Relies  on 

Severity  Index 

calculation  of 

obtained,  reproducible 

maximal-performance 

voice  function 

and  portable;  indicated 

tasks  that  depend 

usually  measured 

to  gauge  pathology 

on  subject’s 

from  -5.0  to  5.0. 

or  document  response 

motivation 

Changes  from 

to  therapy 

and  ability 

baseline  of  2.5 

Requires 

or  greater  are 

high-quality  acoustic 

believed  to  be 

recording  equipment 

clinically  significant 

and  a  quiet  recording 
environment 

VLS  Videolaryngoscopy  or 

Recorded  examination 

Recorded  for  repeated 

Limited  by  subjective 

Videolaryngostroboscopy  of  larynx  during 

review.  Affords 

interpretations  of 

functional  activities 

detailed  examination 

sometimes  subtle 

using  both  halogen 

of  laryngeal 

findings;  stroboscopic 

and  stroboscopic 

structure  and  function 

light  source  can 

light  sources 

to  permit  anatomic 

induce  optical 

and  functional 

illusions;  light 

diagnoses 

imbalances  can 

when  present 

affect  ratings 

CAPE-V  Consensus  Auditory 

Current  rating 

Popular  assessment 

Best  reserved 

Perceptual 

of  voice  quality 

for  diagnosis  of 

for  employment  by 

Evaluation 

as  determined  by 

“dysphonia”; 

voice  professionals. 

of  Voice 

skilled  listener. 

developed  by 

Less  reproducible 

Ratings  of  overall 

a  consensus 

and  lower  inter-rater 

severity,  roughness, 

panel  of  experts. 

reliability  among 

breathiness,  strain. 

non-voice  experts. 

pitch,  and  loudness 

Eew  studies  have 

on  a  modified 

tested  its  reliability 

visual  analog  scales 

to  date 

VCH  Voice  Case  History 

Collection  of 

Easily  obtained 

May  be  influenced 

patient-reported 

by  patient-clinician 

by  patient-clinician 

specific  voice 

interaction;  can 

interaction 

symptoms:  voice 

direct  clinical 

typical,  difficulty 

management 

singing,  difficulty 
projecting, 
hoarseness,  vocal 
fatigue,  and  pain 

decisions 

VHI  Voice  Handicap 

Thirty-question 

Validated  assessment 

Not  widely 

Index 

patient-reported 

of  patient  perceived 

used  or  familiar 

assessment  of 

impact  of  voice 

to  most  surgeons 

voice-related 

deficit  on  quality 

quality  of  life 

of  life;  theoretically 

regarding  impact 

free  from  clinician 

of  voice  disorder 

influence.  Easily 

on  the  patient’s 

obtained  by  simple 

physical,  functional. 

questionnaire.  Valuable 

and  emotional 
well-being 

outcomes  measure 

(  continued ) 
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Abbreviation  Full  term 

Description 

Benefits 

Limitations 

NVO  Negative  Voice 

Novel  comprehensive 

Comprehensive 

New,  yet  to  be 

Outcome 

classification  of 

diagnostic 

validated  by  other 

voice  outcomes 

criteria  that  aims 

investigators.  Requires 

after  thyroidectomy 

to  capture  patients 

comprehensive  voice 

that  accounts  for 

with  functionally 

testing  and  expertise 

neurogenic  and 

relevant  adverse 

not  universally 

functionally  relevant 

voice  after 

available.  Not 

non-neurogenic 

thyroidectomy 

inclusive  of  all 

voice  outcomes  by 

despite  frequent 

acoustic  measures 

incorporating  VCH, 

difficulties  in 

CAPE-V,  DSI,  and 

determining 

VHI  in  addition 

precise  anatomic 

to  VLS  findings 

cause 

discrepancy  may  be  that,  in  this  study,  loudness  was 
not  controlled;  speakers  were  instructed  to  speak 
the  sustained  vowels  and  sentences  in  a  comfort¬ 
able,  typical  voice.  This  consideration  is  relevant 
because  when  adults  phonate  at  a  “comfortable 
level,”  they  often  produce  sounds  that  occur  at 
less  than  the  typical  intensity  (eg,  75  dB)  level. 

A  recent  study  by  Brockerman  et  al^^  showed  that 
the  quieter  a  man’s  phonation,  the  greater  his  level 
of  vocal  jitter.  As  jitter  levels  increase,  the  DSl  score 
decreases;  thus,  not  controlling  for  loudness  may 
have  contributed  to  the  effect  observed  according 
to  patient  sex  in  this  study.  Another  contributory 
factor  may  be  the  relatively  greater  proportion  of 
current  and  former  smokers  among  males  in  this 
population. 

Hakkesteegt  et  al^^  suggested  that  a  change  in 
DSI  of  2.49  or  more  within  an  individual  patient 
is  clinically  relevant.  Although  we  have  shown 
that  change  in  DSI  from  preoperative  values  is  pre¬ 
dictive  of  long-term  voice  problems,  we  would  ad¬ 
vocate  that  the  individual  clinician  should  judge 
at  which  threshold  change  in  DSI  should  trigger 
referrals  for  comprehensive  voice  evaluations.  Sen¬ 
sitivity,  specificity,  and  predictive  values  change  ex¬ 
pectedly  across  various  DSI  thresholds.  For 
example,  in  our  practice,  we  would  seek  a  thresh¬ 
old  with  high  sensitivity  at  the  price  of  specificity 
and  positive  predictive  value  so  as  to  help  ensure 
that  we  do  not  miss  patients  who  might  ultimately 
benefit  from  comprehensive  voice  referrals  and 
therapy.  Admittedly,  this  approach  will  result  in  a 
relatively  greater  rate  of  referrals  to  voice  special¬ 
ists.  In  contrast,  setting  the  DSI  change  threshold 
at  -3.5  in  our  study  would  result  in  only  18%  of 
our  patients  being  referred  for  comprehensive 
voice  testing  and  therapy,  of  whom  55%  (positive 


predictive  value)  were  identified  ultimately  as  hav¬ 
ing  an  NVO. 

We  have  previously  identified  that  changes  in 
the  VHI  and  CAPE-V  at  the  first  postoperative  visit 
relative  to  the  pre-operative  baseline  are  predictive 
of  dysphonia  in  patients  after  thyroidectomy.®®  In 
that  study,  we  defined  dysphonia  as  VLS  abnormal¬ 
ities  in  the  presence  of  a  patient-reported  voice- 
specific  symptom  (s).  In  the  current  study,  we  could 
not  compare  the  predictive  value  of  the  DSI  rela¬ 
tive  to  the  VHI  or  CAPE-V  because  VHI  and 
CAPE-V  measurements  were  included  in  our  re¬ 
vised,  more  inclusive  diagnostic  criteria  for  NVO. 
It  should  also  be  noted  that  the  current  CAPE-V 
data  were  collected  in  a  randomized,  blinded  fash¬ 
ion,  and  the  CAPE-V  data  previously  reported  were 
derived  by  1  voice  expert  during  the  data-collec- 
tion  session.  These  values  differ  somewhat,  as 
reported  by  Solomon  et  al.^® 

DSI  measurements  were  inversely  correlated 
witb  VHI  and  CAPE-V  measurements.  None  of 
the  statistically  significant  correlations,  however, 
reached  a  level  of  correlation  that  we  would  deem 
to  be  compelling  clinically.  This  finding  suggests 
that  the  DSI  calculation  is  not  redundant  with 
other  assessments  of  voice  quality;  rather,  the  DSI 
may  add  additional  functional  voice  data  to  a 
comprehensive  assessment  of  voice.  This  concept 
is  important  to  note,  because  DSI  is  purported  to 
be  a  measure  of  dysphonia,  and  therefore  it  should 
correlate  strongly  with  perceptual  measures  of 
dysphonia,  such  as  the  CAPE-V.  Three  of  the  4 
parameters  of  the  DSI  (quietest  phonation,  high¬ 
est  pitch,  and  MPT)  are  described  more  appropri¬ 
ately  as  measures  of  voice  function  rather  than 
measures  of  dysphonia,  which  leads  us  to  question 
its  face  validity  or  realism  as  a  measure  of  vocal 
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quality.  Our  contention  is  that  the  DSI  may  serve  as 
a  complementary  measure  to  both  perceived  dys- 
phonia  (CAPE-V)  and  functional,  voice-related 
quality  of  life  (VHI) . 

Most  patients  in  this  study  with  NVOs  at  the 
6-month  postoperative  time  point  were  classified 
as  having  subjective  NVO,  based  on  patient  voice- 
specific  complaints  in  the  absence  of  structural  or 
physiologic  laryngeal  impairment.  Some  of  those 
classified  as  subjective  NVO  may  have  had  undi¬ 
agnosed  injuries  to  the  external  branch  of  supe¬ 
rior  laryngeal  nerve  (EBSLN),  the  laryngoscopic 
findings  of  which  are  subtle  typically. Unfortu¬ 
nately,  confirmatory  cricothyroid  electromyo¬ 
graphic  (EMG;  considered  to  be  the  gold 
standard  for  diagnosis  of  EBSLN  injury)  was  not 
performed  in  this  study.  Another  explanation 
for  the  relatively  high  percentage  of  subjective 
versus  objective  NVO  in  this  study  is  that  VLS 
alone  is  insufficient  in  diagnosing  all  possible 
clinically  relevant  adverse  vocal  outcomes  after 
thyroidectomy.  Although  considered  to  provide 
physiologic  data,  VLS  is  also  subjective,  because 
the  results  are  based  on  visual-perceptual  ratings, 
and  optical  illusions  can  occur. Changes  in  DSI 
might  help  to  identify  these  hard-to-diagnose  pa¬ 
tients,  thus  adding  objective  validation  to  voice- 
specific  patient  symptoms  that  might  otherwise 
be  regarded  as  “functional”  when  unsubstanti¬ 
ated  by  other  means  of  objective  diagnostic 
evaluation. 

This  study  has  limitations  that  deserve  mention. 
The  cohort  is  relatively  small  and  few  patients 
suffered  a  durable  (6-month)  NVO  by  our  diag¬ 
nostic  criteria.  The  sample  size  of  our  study  limits 
our  ability  to  define  the  test’s  predictive  values 
precisely  at  differing  thresholds  of  change  from 
baseline.  As  mentioned  previously  laryngeal  EMG 
was  not  a  part  of  this  study  protocol.  It  is  possible 
that  some  of  our  subjective  NVO  patients  actually 
had  unsubstantiated  EBSLN  injury.  Last,  voice 
recordings  from  which  jitter  was  calculated  in 
this  study  did  not  control  for  loudness.  As  men¬ 
tioned  previously,  future  studies  intending  to  use 
jitter  as  a  valid  and  reliable  parameter  in  the  DSI 
must  control  for  loudness. 

In  conclusion,  we  have  characterized  changes  in 
the  DSI  occurring  in  patients  undergoing  thyroid¬ 
ectomy.  Ghanges  in  the  DSI  at  the  first  postoper¬ 
ative  visit  relative  to  pre-operative  baseline  are 
highly  predictive  of  patients  who  will  experience 
long-term  voice  dysfunction  after  thyroidectomy. 
We  suggest  that  these  patients  warrant  compre¬ 
hensive  diagnostic  and  therapeutic  interventions 
early  in  the  postoperative  period. 
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